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Microphone Array for Preserving Soundfield Perceptual Cues 



Related Application 

This invention claim priority from provisional application No. 60/172,967, filed 
5 December 21, 1999. 

Background 

This invention relates to multi-channel audio origination and reproduction. 

Increasing demands for realistic audio reproduction from consumers and music 
10 professionals, and the abilities of modern compression technology to store and deliver 
multichannel audio at bit rates that are feasible, as well as current consumer trends, show 
n that multichannel (herein, more than two channels) sound is coming to consumer audio and 
^ the "home theater." Numerous microphone techniques, mixing techniques, and playback 
jU formats have been suggested, but a great deal of this effort has ignored the long-established 
r 715 requirements that have been found necessary for good perceived sound-field reproduction. 
E0 As a result, soundfield capture and reproduction remains one of the key research 
3 challenges to audio engineers. 

If 7 The main goal of soundfield reproduction is to reconstruct the spatial, temporal and 

M qualitative aspects of a particular venue as faithfully as possible when playing back in the 
;h20 consumer's listening room. Artisans in the field understand, however, that exact 
^ soundfield reproduction is unlikely to be achieved, and probably impossible to achieve, for 
basic physical reasons. 

There have been numerous attempts to capture the experience of a concert hall on 
recordings, but these attempts seem to have been limited primarily to the idea of either co- 
25 incident miking, which discards the interaural time difference, or widely spaced miking, 
which provides time cues that are not of the range 0 to ± 0.9 msec, and thus provide cues 
that are either not expected by the auditory system or constitute contradictory information. 
The one exception appears to be binaural miking methods, and their derivatives, which do 
two-channel recording and which attempt to take some account of human head shape and 
30 perception, but which create difficulties both in the matching of the "artificial head" or 
other recording mount, and which do not allow the listener to sample the soundfield by 

1 



small head movements. (Listeners unconsciously use small head movements to sample 
soundfields in normal listening environments.) 

In the realm of multichannel audio, current mixing methods consist of either co- 
incident miking (ambiphonics) or widely spaced miking (the purpose being to de-correlate 
5 the different recorded channels), neither of which provides both the amplitude and time 
cues that the human auditory system expects. 

Summary of the Invention 

Rather than capturing, and later reproducing, the exact soundfleld, the principles 
10 disclosed herein undertake to reconstruct the listener-perceived soundfleld. This is 
achieved by capturing the sound using a set of directional microphones that lie 
approximately on a sphere having a diameter of 0.9 ms sound travel. The 0.9 ms sound 
distance approximates the inter-aural time delay. Advantageously, one directional 
microphone points upward, one directional microphone points downward, and the 
15 remaining microphones (e.g., five of them) are arranged relatively evenly in the horizontal 
plane. On one embodiment, the signals from the microphones that point upward and 
downward are combined with the signals of the horizontal microphones before the signals 
of the horizontal microphones are recorded. 

20 Brief Description of the Drawings 

FIG. 1 presents an arrangement of microphones in accord with the principles of 
disclosed herein; and 

FIG. 2 illustrates a microphone sensitivity pattern of microphones used in the 
FIG. 1 arrangement. 

25 

Detailed Description 

In connection with human perception of the direction and distance of sound 
sources, a spherical coordinates system is typically used. In this coordinate system, the 
origin lies between the upper margins of the entrances to the listener's two ear canals. The 
30 horizontal plane is defined by the origin and the lower margins of the eye sockets. The 
frontal plane is at right angles to the horizontal plane and intersects the upper margins of 
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the entrances to the ear canals. The median plane (median sagittal plane) is at right angles 
to both the horizontal and frontal planes. In the context of this coordinate system, the 
angular position of an auditory event is described by y , which is the distance between the 
auditory event and the center of origin; 0 , which is the azimuth angle; and S , which is the 
5 elevation angle. 

Two cues provide the primary information for determining the angular position, y , 
of a source. These are the interaural time difference and the interaural level difference 
between the two ears. The direction from where the sound is perceived to be coming can 
be rotated about the axis passing through the ear canals to create a "cone of confusion" that 
10 describes where the sound may come from. The localization to the cone of confusion can 
be done by either time or level cues, or both. At low frequencies, the interaural time 
difference is directly detectable by the human auditory system. At frequencies above 2kHz 
%j to 3 kHz, this ability to synchronously detect the differences disappears, and the listener 
ll must rely, for time-stationary signals, on level differences created by the HRTF. For non- 
YjlS stationary signals that include a "leading edge", however, the ear is capable of using the 
Cu envelope of the signal as an interaural time difference cue, allowing both time and level 

cues even at high frequencies. 
H Most of the interaural level difference lies in the effect of the diffraction of the 

y ; sound wave around the listener's head. The sound shadow caused by the head is 
^120 particularly important when the sound's wavelength is close to, or smaller than, the size of 
O the head. Hence, the interaural level difference is frequency dependent; the shorter the 

wavelength (the higher the frequency), the greater the sound shadow and hence the larger 
the interaural level difference. As a result, interaural level difference works particularly 
well at high frequencies and is the main directional cue at high frequencies for signals with 
25 stationary, energy envelopes. The interaural level difference is also directionally variable 
in S , varying with the position of the sound source in azimuth, which helps disambiguate 
the information from the "cone of confusion." 

For sounds with a non-time-stationary energy envelope, the interaural time 
difference cue is not limited to low frequency signals detection. The ear is sensitive to the 
30 attacks and low frequency content in the envelope of complex sounds. In other words, the 
auditory system makes use of the interaural time difference in the temporal envelope of the 
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sounds in order to determine the location of a sound source. 

Particularly for sounds that happen to come from within the cone of confusion, the 
interaural time and level cues in general are not sufficient for three-dimensional sound 
localization. It is the binaural spectral characteristics of the signal due to head-related 
5 transfer functions (HRTFs) that help explain the human hearing mechanism when 

distinguishing between sound sources located in three-dimensional space, particular those 
located along a cone of confusion. When sound waves propagate in space and pass the 
human torso, shoulders, head and the outer ears (pinnae), diffractions occur and the 
frequency characteristics of the audio signals that reach the eardrum are altered. The 

10 spectral alternations of the input signals in different directions are referred to as the head- 
related transfer functions (HRTFs) in the frequency domain and head-related impulse 
response (HRIR) in the time domain. Because the wavelength of high frequencies is closer 
to the size of those small body parts, such as head and pinna, the spectral change in sounds 
is mostly limited to frequency components above 2 kHz. HRTFs vary in a complex way 

15 with azimuth, elevation, range and frequency. In general they differ from person to person 
as the amount of attenuation at different frequencies depends on the size and shape of the 
objects (such as pinna, nose and head) of the individual person. Head-related transfer 
functions are also directionally dependent and, for example, this usually causes more high 
frequency attenuation from sounds coming behind a person than those coming in front of 

20 the person. In general, there is a broad maximum near the ear canal resonance, 2 - 4kHz 
for sound sources located in the median-sagittal plane. For frequencies above 5 kHz, the 
HRTFs are characterized by a spectrum notch, which occurs at a frequency varying with 
the position of the sound source. When the source is below, the notch appears near 6 kHz. 
The notch moves to higher frequencies when the source is elevated. However, when the 

25 source is overhead, the HRTF has a relatively flat spectrum and the notch disappears. In 
this invention, the system advantageously uses, for the horizontal plane, the HRTF of the 
listening individual to a much greater extent than "auralization" techniques. If a situation 
exists where the placement of "up" and "down" loudspeakers exists, it would also be 
preferential to use same, however most consumer situations prevent this extension of the 

30 techniques from being practical at the present time. 

With this knowledge about the human auditory system, in accordance with the 
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principles of this invention, a sound is recorded with the notion of capturing the sound 
elements as they are perceived by the human auditory system. 

To that end, the sound-capturing arrangement disclosed herein employs a plurality 
of directional microphones that are arranged on a sphere having a diameter that 
approximately equals the distance that corresponds to the time that it takes a sound to 
travel from one ear to the other (approximately 0.9 msec). In this disclosure, this distance 
is referred to as the interaural sound delay. 

FIG. 1 depicts one embodiment of a sound recording arrangement in accord with 
the principles disclosed herein. It includes seven microphones that are positioned in space 
to lie on a sphere 10. These microphones are each directional microphones that will 
capture the sound from a particular direction, with the time delay between microphones 
being determined by the effective location of the microphone capsule inside the 
microphone body. Sphere 10 is not a physical element, of course. It is just a convenient 
means for describing the spatial position of the microphones. The origin of the sphere lies 
in the above-mentioned horizontal plane, which in FIG. 1 is labeled 20. One of the 
microphones, 31, is positioned to point upward, basically perpendicular to the horizontal 
plane; and another of the microphones, 32, is positioned to point downward, also basically 
perpendicular to the horizontal plane. The remaining microphones are arranged along the 
intersection of the horizontal plane and the sphere (which is a great circle). One of those 
microphones faces the direction that is considered the "front" (the direction at which a 
listener would be facing, if the listener were to replace the microphones), and the 
remaining microphones are arranged symmetrically about the midline. With five 
microphones facing horizontally, an acceptable arrangement places the microphones 72° 
apart. With seven microphones facing horizontally, an acceptable arrangement is ±45°, 
±90°, and ±150°. Although again, a center-front equal spacing will provide good results as 
well. 

The number of microphones used is not critical. One can use, for example, the five 
horizontally-facing microphones employed in the FIG. 1 arrangement, without the "up" 
and "down" microphones. Of course, the performance would suffer because these 
microphones detect the reflections off the ceiling and floor, respectively, and those 
reflections are significant contributors to spatial effects and to the sense of distance. It is 



advantageous, though, to have an odd number of microphones that face horizontally, with 
one facing the front, as mentioned above. It is also marginally acceptable to use fewer than 
five, and desirable to use more than five, microphones in the horizontal plane, if the 
consumer deliver mechanisms exist. A minimum of three microphones, aimed to the front 
5 of the listener, are required in any case, meaning that one microphone is directed at the 
direction at which a listener would be facing, and the other two microphones are aimed at 
angles ±ot<90° away from that direction, such as with angles ±a<30° or ±a<45°. 

FIG. 1 depicts distinct directional microphones 31 through 37 but, actually, it has 
been found that the reception pattern of those microphones is what plays a more important 
1 0 role than the number of microphones, and if the desired pattern is best realized with a 
collection of individual microphones, use of such a collection is clearly acceptable. For 
purposes of this disclosure, in fact, such a collection is considered as a single microphone. 

As for the desirable reception pattern, it can be like the one depicted in FIG. 2. 
This pattern is characterized by a primary (front) lobe that is down 3db by at a direction of 
1 5 the immediately neighboring microphone, and is down to effectively zero at a direction of 
the next-most immediate neighboring microphone (e.g., more than 40db down). This 
pattern depicts the sensitivity of the microphone to arriving sounds. The microphone is 
said to point to a direction, that being the direction at which the microphone's sensitivity is 
greatest. Since FIG. 2 depicts the five horizontal microphones arrangement of FIG. 1 
20 where the microphones are 72° apart, this requirement translates to a primary lobe that is 
down by 3 db at 72° and down to effectively zero at 144°. The microphones can also have 
a small back (possibly negative phase) lobe, but it is not required. 

There may be occasions when it is desirable to record all of the received sound 
channels; that is, the signals of all seven of the FIG. 1 microphones. For example, if a 
25 listener is. in a room that includes an ceiling speaker that faces down, and a floor speaker 
that faces up, both roughly above the listener's head and below the listener's feet, 
• respectively, then it is most advantageous to record the signals of microphones 3 1 -3 7 and 
to send the signal of microphone 31 to the ceiling speaker and the signal of microphone 32 
to the floor speaker. Conversely, when it is expected to employ the recorded signals in a 
30 room with only five speakers, and, therefore the signals of microphones 3 1 and 32 need to 
be combined with the other five signals, then it makes more sense to combine the signals 
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before storing, thereby saving on storage space. Of course, if the signals are merely 
transmitted to a remote location, the processing (i.e., combining) of signals can be done at 
the remote location. 

Because microphones 31 and 32 are placed appropriately for capturing the time 
5 delay according to the human head, they can be folded easily into the signals of 
microphones 33-37, using the equation 

^31 ~ ^31 + ^7 W31 ^32 / ' 

without further processing for HRTF and delay. If a superior result is desired, one 
can add some processing for both mike and listener's effective HRTF's, but this has been 
10 proven in practice to be very well approximated by the simple sum of components. 
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1. A sound recording arrangement comprising: 

a plurality of at least three microphones, with at least one pair of said microphones 
providing a sound time-of-arrival difference of approximately 0.9msec; and 

means for communicating signals of said microphones to other equipment 

5 

2* The arrangement of claim 1 where said means is output ports, recording 
apparatus, or a signal transmission apparatus. 

3. The arrangement of claim 1 where said microphones are directional. 

10 

4. The arrangement of claim 5 where said plurality of directional microphones, as a 
group, are more sensitive to sound arriving from a front direction of said arrangement than 

i from any other direction. 

5. The arrangement of claim 3 where a majority of said microphones point to 
directions lying substantially on a horizontal plane. 

6. The arrangement of claim 5 having at least one microphone additional 
microphone that is aimed at other than on said horizontal plane. 

7. The arrangement of claim 5 where one of said microphones that point to 
directions lying on a horizontal plane points to a front direction. 

8. The arrangement of claim 7 where others of said microphones that point to 

25 directions lying on a horizontal plane are paired up, and each pair j is pointed to directions 
±a„ where a t ^a } when j . 

9. The arrangement of claim 8 where sensitivity of a microphone that points at 
direction oc/ has a sensitivity at direction a I± \ that is down 3 db from said sensitivity at 

30 direction a/, and said sensitivity of a microphone that points at direction ot/ has a sensitivity 
at direction a I±2 that is more than 40db down from said sensitivity at direction a/. 
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10. The arrangement of claim 1 where said microphones are situated at effective 
positions in space that lie on a surface that approximates a sphere. 

5 11. The arrangement of claim 10 where said sphere has a diameter that 

corresponds to a time of sound travel of approximately 0.9msec. 

12. The arrangement of claim 1 where said a plurality of at least three microphones 
comprises a plurality of at least three microphones that point to directions on a horizontal 
10 plane, and an additional microphone that points at a direction that is substantially 
perpendicular to said horizontal plane. 

,1 13. The arrangement of claim 12 where said additional microphone points 

downward from said horizontal plane, or upward from said horizontal plane. 

IJ15 

J ri 14. The arrangement of claim 1 where said a plurality of at least three microphones 

comprises a plurality of at least three microphones that point at directions substantially on 
a horizontal plane, one additional microphone that points at a direction that is substantially 

=7 perpendicular and upward from said horizontal plane, and another additional microphone 

miO that points at a direction that is substantially perpendicular and downward from said 

f i horizontal plane. 

15. The arrangement of claim 14 further comprising a storage medium coupled to 
said means for communicating signals of said microphones to other equipment, for storing 

25 in a separate area the signals of each of said plurality of microphones. 

16. The arrangement of claim 14 further comprising a processor for combining 
selected ones of said signals of said plurality of at least three microphones. 

30 17. The arrangement of claim 16 where said processor develops a modified signal 
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s [=s.+ -1= (s + s ,) , for each signal s h of a microphone from said plurality of at least 

three microphones that points at a direction that lies substantially on said horizontal plane, 
where s u is the signal of said microphone that points substantially upward relative to said 
horizontal place, and said s d is the signal of said microphone that points substantially 
downward relative to said horizontal place. 

18. The arrangement of claim 1 where said plurality of at least three microphones 
comprises an odd number of microphones that point to directions that lie substantially on a 
horizontal plane. 

19. The arrangement of claim 18 where said plurality of at least three microphones 
3 comprises five microphones that point to directions 0°, ±72°, and ±144°. 

J 20. The arrangement of claim 18 where said plurality of at least three microphones 

01 5 comprises seven microphones that nominally point to directions 0°, ±45°, ±90°, and ±150°. 

* 21. An arrangement to reproduce sound from a plurality of channels, comprising: 

* an N plurality of input ports for receiving signals picked up by an N plurality of 
=5 microphones, where one of said microphones points at a direction that is substantially 
3 20 perpendicular to and upward from a horizontal plane and picks up signal s„ , another of 

said microphones points at a direction that is substantially perpendicular to and downward 
from said horizontal plane and picks up signal s d , and remaining N-2 of said microphones 
point at directions that substantially lie in said horizontal plane and pick up signals s' h ; and 
a processor for developing signals s' h ' , i=\ , 2, . . . N-2, such that 
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Abstract 

A sound-capturing arrangement uses a set of directional microphones that lie 
approximately on a sphere having a diameter of 0.9 ms sound travel, which approximates 
the inter-aural time delay. Advantageously, one directional microphone points upward, 
one directional microphone points downward, and the odd number of microphones are 
arranged relatively evenly in the horizontal plane. On one embodiment, the signals from 
the microphones that point upward and downward are combined with the signals of the 
horizontal microphones before the signals of the horizontal microphones are transmitted or 
recorded. 
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IN THE UNITED STATES 
PATENT AND TRADEMARK OFFICE 

Declaration and Power of Attorney 

As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below next to my name. 

I believe I am an original, first and joint inventor of the subject matter which is claimed 
and for which a patent is sought on the invention entitled Microphone Array for Preserving 
Soundfield Perceptual Cues the specification of which was filed on 12/21/99, as application 
Serial No. 60/172,967. 

I hereby state that I have reviewed and understand the contents of the above-identified 
specification, including the claims, as amended by an amendment, if any, specifically referred to 
in this oath or declaration. 

I acknowledge the duty to disclose all information known to me, which is material to 
patentability as defined in Title 37, Code of Federal Regulations, 1.56. 

I hereby claim foreign priority benefits under Title 35, United States Code, 119 of any 
foreign application(s) for patent or inventors' certificate listed below and have also identified 
below any foreign application for patent or inventors' certificate having a filing date before that 
of the application on which priority is claimed: 

None 

I hereby claim the benefit under Title 35, United States Code, 120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States application in the manner provided by the 
first paragraph of Title 35, United States Code, 112, we acknowledge the duty to disclose all 
information known to us to be material to patentability as defined in Title 37, Code of Federal 
Regulations, 1 .56 which became available between the filing date of the prior application and the 
national or PCT international filing date of this application: 

None 

I hereby declare that all statements made herein of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and further that these 
■ statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code and that such willful false statements may jeopardize the validity of the application or any 
patent issued thereon. 
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I hereby appoint the following attorney(s) with full power of substitution and revocation, 
to prosecute said application, to make alterations and amendments therein, to receive the patent, 
and to transact all business in the Patent and Trademark Office connected therewith: 



Samuel H. Dworetsky 


(Reg. No. 27873) 


Thomas A. Restaino 


(Reg. No. 33444) 


Michele L. Conover 


(Reg. No. 34962) 


Cedric G. DeLaCruz 


(Reg. No. 36498) 


Rohini K. Garg 


(Reg. No. 45272) 


Thomas M. Isaacson 


(Reg. No. 44166) 


Benjamin S. Lee 


(Reg. No. 42787) 


Robert B. Levy 


(Reg. No. 28234) 


Susan E. McGahan 


(Reg. No. 35948) 


Gary H. Monka 


(Reg. No. 35290) 


Jeffrey M. Navon 


(Reg. No. 32711) 


Stephen K. Pentlicki 


(Reg. No. 40125) 


Alfred G. Steinmetz 


(Reg. No. 22971) 







I also appoint the following as associate attorney(s), with full power to prosecute said 
application, to make alternations and amendments therein, and to transact all business in the 
Patent and Trademark Office connected therewith: 

Henry T. Brendzel (Reg. No. 26,844) 
William Ryan (Reg. No. 26,844) 

Please address all correspondence to Henry T. Brendzel, P.O. Box 574, Springfield, NJ 
0708 1 . Telephone calls should be made to Henry T. Brendzel at (973) 467-2025. 



Full name of joint inventor: James David Johnston 

Inventor's signature Date 

Residence: Morristown, Morris County, NJ 
Citizenship: USA 

Post Office Address: 12 Stonehenge Road 
Morristown, NJ 07960 



Full name of joint inventor: Eric R. Wagner 
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of the application on which priority is claimed: 
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I hereby claim the benefit under Title 35, United States Code, 120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this 
application is not disclosed in the prior United States application in the manner provided by the 
first paragraph of Title 35, United States Code, 112, we acknowledge the duty to disclose all 
information known to us to be material to patentability as defined in Title 37, Code of Federal 
Regulations, 1.56 which became available between the filing date of the prior application and the 
national or PCT international filing date of this application: 

None 

I hereby declare that all statements made herein of my own knowledge are true and that 
all statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
■ Code and that such willful false statements may jeopardize the validity of the application or any 
patent issued thereon. 
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I hereby appoint the following attorney(s) with full power of substitution and revocation, 
to prosecute said application, to make alterations and amendments therein, to receive the patent, 
and to transact all business in the Patent and Trademark Office connected therewith: 



Samuel H. Dworetsky 


(Reg. No. 27873) 


Thomas A. Restaino 


(Reg. No. 33444) 


Michele L. Conover 


(Reg. No. 34962) 


Cedric G. DeLaCruz 


(Reg. No. 36498) 


Rohini K. Garg 


(Reg. No. 45272) 


Thomas M. Isaacson 


(Reg. No. 44166) 


Benjamin S. Lee 


(Reg. No. 42787) 


Robert B. Levy 


(Reg. No. 28234) 


Susan E. McGahan 


(Reg. No. 35948) 


Gary H. Monka 


(Reg. No. 35290) 


Jeffrey M. Navon 


(Reg. No. 32711) 


Stephen K. Pentlicki 


(Reg. No. 40125) 


Alfred G. Steinmetz 


(Reg. No. 22971) 







I also appoint the following as associate attorney(s), with full power to prosecute said 
application, to make alternations and amendments therein, and to transact all business in the 
Patent and Trademark Office connected therewith: 

Henry T. Brendzel (Reg. No. 26,844) 
William Ryan (Reg. No. 26,844) 

Please address all correspondence to Henry T. Brendzel, P.O. Box 574, Springfield, NJ 
07081. Telephone calls should be made to Henry T. Brendzel at (973) 467-2025. 



Full name of joint invent^ Jame^avW Johnston 
Inventor's signature 




Date 




Residence: Morrisjfc 
Citizenship: US^ 
Post Office Address: 12 Stonehenge Road 
Morristown, NJ 07960 



Full name of joint inventor: Eric R. Wagner 

Inventor's signature Date 

Residence: South Plainfield, Middlesex County, NJ 
Citizenship: USA 

Post Office Address: 400 Rahway Avenue 
South Plainfield, NJ 0708 1 



